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SYMBOLS 


d minimum (Hamming) distance between code words in a code 
E(x) error polynomial 

e order of an element in GF(2") 

ei error value at location of received codeword 

G generator matrix of a code 


GF(2™) Galois field of 2” elements 


g(x) generator polynomial of a cyclic code 

H parity-check matrix of a code 

k number of information symbols in a code 

M(x) message polynomial 

m a positive integer 

m(x) minimum polynomial 

n code length (total number of symbols in a code) 


(n,k) linear block code with parameters n and k 


P number of errors in received codeword 

p(x) primitive polynomial 

q a positive integer defines the numbers of element in the Galois field 
R(x) received polynomial 

r(x) | remainder polynomial 

i) syndrome of a parity check 

5, a Syndrome component calculated by root substitution method 

Ss x a syndrome component calculated by R(x)/g(x) 

t error correcting capability of a code 

V(x) transmitted polynomial 
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error location at jth position of received codeword 
primitive element of GF(2™) 
error location number 


error location polynomial 
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A DECODING PROCEDURE FOR THE REED-SOLOMON CODES 
Raymond S. Lim 


Ames Research Center 
SUMMARY 


This paper describes a decoding procedure for the (n,k) t-error- 
correcting Reed-Solomon (RS) code, and an implementation of the (31,15) RS 
code for the I4-TENEX central system. This code can be used for error cor-~ 
rection in large archival memory systems. The principal features of the 
decoder are (1) a Galois field arithmetic unit implemented by microprogram- 
ming a microprocessor, and (2) syndrome calculation by using the g(x) 
encoding shift register. Complete decoding of the (31,15) code is expected 
to take less than 500 psec. The syndrome calculation is performed by hard- 
ware using the encoding shift register and a modified Chien search. The 
error location polynomial is computed by using Lin's table, which is an inter- 
pretation of Berlekamp's iterative algorithm. The error location numbers are 
calculated by using the Chien search. Finally, the error values are computed 
by using Forney's method. 


PROLOGUE 


o 


Historically, in the development of algebraic coding theory, the Reed- 
Solomon (RS) codes were recognized as the most powerful and elegant, and also 
the most complicated, block codes to decode. These codes were first described 
by I. S. Reed and G. Solomon in 1960. A systematic decoding algorithm was not 
discovered until 1968 by E. Berlekamp. Because of their complexity, the RS 
codes are not widely used except when no other codes are suitable. 


In 1968, a (63,53) RS code with code symbols from’ GF(2°) was used to 
salvage the IBM Photo-Digital Storage System. In 1975, a (4095,4001) RS code 
with code symbols from GF(2!2) was used to make the magnetic tape archival 
data storage system viable at the Bureau of the Census, Department of Com- 
merce. In 1975-1976, a (31,15) RS code with code symbols from GF(2°) was 
designed by E. Berlekamp for use in a classified defense communication system. 
Before proceeding with a description of decoding the RS codes, it is felt that 
the following poems selected to amplify the coding spirit are appropriate. 


In Galois Fields* 


In Galois fields, full of flowers 
primitive elements dance for hours 
climbing sequentially through the trees 
and shouting occasional parities 


The syndromes like ghosts in the misty damp 
feed the smoldering fires of the Berlekamp 

and high-flying exponents sometimes are downed 
on the jagged peaks of the Gilbert bound. 


Message and Clarity? 


A message with content and clarity 
Has gotten to be quite a rarity. 

To combat the terror of serious error, 
Use bits of appropriate parity. 


INTRODUCTION 


With present technology, very large memory systems (21012 bits) designed 
for the archival storage of digital data are critically dependent on elec- 
tronic error correction systems (EECS) for ensuring system viability and 
integrity (refs. 1, 2). In the IBM 3850 Mass Storage System, the EECS used 
is an Extended Group Coded Recording capable of correcting up to 32 8-bit 
bytes of data in a 208-byte data block. In the CDC 38500 Mass Memory System, 
the EECS used is a modified Group Coded Recording similar to that used in the 
IBM 2400 Series magnetic type systems. 


The use of magnetic tape systems for archival storage of digital data 
depended even more critically on EECS to make them viable. The EECS devised 
by Brown and Sellers, which was used in the IBM 2400 series magnetic tape 
system, is not adequate for long-term archival storage of data (refs. 3-5). 


At the Institute for Advanced Computation (IAC), archival storage systems 
such as the UNICON 690 (or UNICON 190), magnetic tape systems, and other mass 
memory systems are no exceptions. The viability of these systems depends 
critically on EECS. For the Institute, instead of designing a different EECS 
tailored to each particular archival system, it is advantageous to design 
just one EECS powerful enough to serve all systems within the Institute. 


*By S. B. Weinstein of Bell Telephone Labs., IEEE, Trans. on Inf. Theory, 
March 1971, p. 220. : 


tFrom Error Correcting Codes, H. B. Mann, Ed., Wiley, N.Y., 1968. 


This paper describes the (n,k) t-error-correcting Reed-Solomon (RS) codes 
and a decoding procedure suitable for implementation with the present technol- 
ogy. In particular, a (31,15) RS code is chosen as the EECS for the IAC I4- 
TENEX central system. This code is not a binary code, but a q-ary code with 
code symbols from GF(2°). It is believed that this code is powerful enough 
to. meet all anticipated IAC requirements. Because of the PDP-10, the PDP-11, 
and the ILLIAC IV computers comprising the I4-TENEX system, this code is 
planned to have two modes of operation: 36-bit mode and 16/8-bit mode. The 
decoding of this code will be implemented by hardware and firmware, and con-. 
sists of four steps: (1) the syndrome calculation is performed by hardware 
using the encoding shift register and a modified Chien search; (2) the error 
location polynomial computation is performed by firmware by microprogramming 
a 2900 series microprocessor to implement the Berlekamp iterative algorithm; 
(3) the error location numbers calculation is performed by hardware using the 
Chien search method; and (4) the error values computation is performed by 
firmware using a method suggested by Forney. Finally, this (31,15) RS code 
EECS is interfaced to the I4-TENEX system by means of a standard IAC 1011- 
Interface, like the Q1011 (ref. 6). With this interface, this EECS is just 
another processor in the I4-TENEX central system. 


The author wishes to thank Professor Shu Lin of the University of Hawaii 
for his initial consultation and for reading this paper. He also thanks his 
colleagues D. K. Stevenson, G. F. Feierbach, and P. Hiss for reading and com- 
menting on the work reported herein. 


REED-SOLOMON CODES 


The Reed-Solomon (RS) codes (refs. 7-9) are the most powerful of the 
known classes of block codes capable of correcting random errors and multiple- 
burst errors. The RS codes are nonbinary codes with code symbols from a 
Galois field of q elements GF(q). From coding theory, if. p is a prime 
number and q is any power of p, there are codes with code symbols from a 
q-symbol alphabet. These codes are called q-ary Bose-Chaudhuri~Hocquenghem 
(BCH) codes. 


For engineering applications at the present, only binary codes derived 
from RS codes are of interest. For this reason, GF(q) will be restricted to 
GF(2™), where m is a positive integer. The field GF(2™) is formed by a 
primitive polynomial of degree m with a as the primitive element of the 
field. In the algebra of a Galois field, a is also called the nth _ root of 
unity in GF(2™) since of = 1 for n = 2M- 1. With q = 2™, the code symbols 
of an RS code are at, i=, 0,1, 2,..., 2M - 2, which are the 2™ distinct 
elements of GF(2™). The notation a” = 0 is used here. 


Let mg, d, s, and t be any positive integers, and a be an element of 
GF(qS). There exists a q-ary BCH code of length n = qS - 1 symbols that 
corrects any combination of t or fewer errors and requires no more than 2st 
parity~check symbols. Let g(X) be a polynomial of lowest degree over GF(q) 
and select q = 2". The code generated by g(X) is a cyclic BCH code and has 

oO | gtotl | ; qtotd-2 (1) 


o ee 


as roots. The special q-ary BCH code,’ for which s = mj = 1 and d= 2t +1, 
is called the RS code. The roots of the RS code are 
a,at,a%,...,a2t (2) 


Since the minimum polynomial with root of ao is simply (xtal), the genera- - 
tor polynomial g(X) of a t-error-correcting RS code of length 2" - 1 is 


g(X) = (X+a)(X+a2) 2... (XK +02°) (3) 


The codeword polynomials generated by g(X) consist. of the multiples of 
g(X) modulo X® +1, and have a, a%,a59,...,0 as roots. Since g(X) has 
degree 2t, and o is a primitive nth root of unity in GF(2™), the RS code 
generated by g(X) is a t-error-correcting cyclic code with the following 
parameters: 


Code length (symbols): n= 27-1 
Number of parity check symbols: n—- k = 2t 
Minimum distance: d= 2t +1 


>m 


Number of information symbols: k=2 - 1 - 2t 


Since a symbol in GF(2™) can be expressed as an m-tuple over GF(2), the 
parameters of an RS code over GF(2) are: 


n = m(2™-1) bits 
n-k = 2mt bits 
2t+ 1 
k = m(2™-1-2t) bits 


Qu 
Ih 


In coding theory, if g(X) is a polynomial of degree n-k and is a fac- 
tor of xX" + 1, then g(X) generates an (n,k) cyclic code (ref. 7, theorem 
4.3; ref. 8, theorem 8.1). One way to show that an RS code generated by g2(X) 
is cyclic is to describe the code in terms of its roots of g(X) in GF(2M). 
The order e of a field element a+ is the least positive integer for which 
(at)® = 1. Since (a7)" = 1, e must be a factor of n= 2-1. If e 
divides n, then (X©°+1) divides (X"+1). Furthermore, an element al of 
order e must be a root of (X°+1), then (X+a1t) divides (X©°+1), and hence 
it divides (X"+1). Therefore, (X"+1) has as roots all the n= 27 - 1 
nonzero elements of GF(2™). Since g(X) has al, a2, a3, a8 2, at as roots, 
@(X) is a factor of (X®+1), and hence g(X) generates a cyclic code. 


ERROR CORRECTING CAPABILITY 


The RS codes over GF(2™) are very effective for correcting random and 
burst errors. Since each code symbol is an m-tuple (or m-bit symbol) over 
GF(2), a t-error-correcting RS code is capable of correcting any error pat- 
tern that affects t or fewer m-bit symbols. For example, since a burst of 
length 3mtl cannot affect more than four m-bit symbols, a four-symbols cor- 
recting code can correct any single burst of length 3m+tl or less. It can 
also simultaneously correct any combination of two bursts of length ml or 
less because each such burst can affect no more than two symbols. At the same 
time, it can correct any combination of four or less random errors. In gen- 
eral, the RS code with error correcting capability t can be used to correct 
any of the following errors: 


l. All single bursts of length bj , no matter where it starts, if 
b) <m(t-1) + 1 

2. Two bursts of length no longer than bj each, no matter where each 
burst starts, if by, < m([{t/2] -1) + 1, or any p_ bursts of length 
no longer than bp each, no matter where each burst starts, if 


bp < m([t/p] -1) + 1 


From the above discussion, it follows that the RS code can be used to correct 
random errors, Single-burst errors, or multiple-burst errors. 


CODE SELECTION 


The RS codes offer the designer a wide range of code parameters. In 
coding theory, a block code with parameters n and k is denoted as (n,k). 
In table 1, a list of RS codes is tabulated for m equal to 4, 5, and 6 with 
t ranging from 2 to 10. For the IAC I4-TENEX system consisting of computers 


5 


TABLE 1. - LIST OF REED-SOLOMON CODES 


Te ee [are poe Dee 


(15,11) 44 
(15,9) 
(15,7) 
(15,5) 
(15, 3) 


(31,27) 
C31 525) 
(31,23) 
(31,21) 
(31,19) 
(31,17) 
(31,15) 
(31,13) 


(63,55) 
(63,53) 
(63,51) 
(63,49) 
(63,47) 
(63,45) 
(63,43) 


OMAN DUFF OMNDUN PWN BDU WND 


— 


with word lengths of 16 bits (PDP-lls), 36 bits (PDP-10s), and 32/64 bits 
(ILLIAC IV), the best choice for fitting these word lengths is the (31,15) 
code. The formats for the 36-bit and the 16/8-bit are shown in figure l. 


BIT 0 1 2 36 37 38 #39 . . . #73 74 #%7 76 154 
NOT — ONE PDP—10 WORD ONE PDP—10 WORD PARITY—CHECK BITS 
USED ; 


(a) Fitting of two PDP-10 words into 75 bits. 
Bit-@ is not used (always equal to zero). 


0342. . 6 7-23 24-40 41-57 58-74 75 — 154 
ERG Ae ee ee oe Oe ~<a 
NOT USED B, B, Bo B, PARITY—CHECK BITS 


(b) Fitting of four PDP-11 words into 75 bits. 
Bi =9, 1,2, and 3) is a 16-bit word plus parity. 


Figure 1. - Data formats of the (31,15) RS code for 36-bit and 16/8-bit 
modes. Unused-leading bits are always zero. 


ENCODING 


There are two methods for encoding linear cycle codes: the serial shift 
register method and the parallel matrix method. Let M(X) be a message poly- 
nomial with k symbols encoded into a code polynomial (codeword) V(X) with 
n symbols. In the serial shift. register method, encoding in systematic form 
is accomplished by dividing xn-k w(x) by g(X) and appending the remainder 
r(X) to x®-kycx). That is 


v(x) = r(x) + x * cx) = q(x) 2(X) (4) 


where q(X) is the quotient. This indicates that (r(x) +x®7k ux) ] is a mult- 
iple of g(X) and, therefore, is a code polynomial generated by g(X). The 
codeword generated is ‘ 


(ror iro eee ee ae | MoM) .. - m1) 


message 


a parity check -| 
bits 


bits 


and the most significant symbol of the message, m,_j, is sent first. 


In the parallel matrix method, the generator matrix G has the form 


G = } (5) 


Pex (nek)? Ik xk 
where P is a k~x(n-k) matrix generated by retaining the remainder of 
yack 

g(X) 


and I is a kxk identity matrix. The encoding of the message vector M 
to a code vector V is 


Re ee ey es es | (6) 


V = MG © (7) 


The (31,15) RS code has n = 31, k = 15, m= 5, and t = 8. The primi- 
tive polynomial p(X) = X> + X2 + 1 can be used to generate the 31 nonzero 
elements of GF(2°), as shown in table 2. The generator polynomial g(X) 
from equation (3) is 


g(X) = (Xt+a)(X+a2) .. . (X+a!5) (X ta!&) (8) 


Multiplying out the terms of g({X), the general form of g(X) is 
15 i 
2g(X) = xi6 + 22 a, x = xlé + a)5 x! + a,,X!*+.. ~tay X + ag (9) 
°i=0 


The evaluation of a; in equation (9) is straightforward but extremely tedi- 
ous. In order to avoid errors, a computer program should be used in conjunc- 
tion with table 2 for such evaluation. 


The implementation of equation (9) for encoding using the parallel 
matrix method is a matrix G as shown in figure 2. Each Oj,;] %$in the 


matrix G is a field element in GF(2°). The encoding.of M(X) into V(X) is 
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o on OF Ww fF Ww | 


TABLE 2. - GALOIS FIELD OF 2°. 


RX 22g rRro 


eke 


2 
3 
4 
5 
6 
7 
8 
9 
0 


RFPreOOrF OF OCOOFrFOCCOOC oO 
KSHeK RFP OOK F OF OCOrKF OOOO 
Kee eK OOrF OF OCOrFOOCO 
O0OrFRrROCOrYF COr CCC OCOFOO 
rFOOrrF OroaOroaocorsd 
KHOR OEFHKHPRFP ORE OO ORH 
Be ie eS eS ie 
COFORFPOKRF FP ORFHOOOF 
KH OF FP RP OR KF OOO Re ee 
Or OR KF rR OF KF OOO RH ee 


Note: Elements generated by p(a) = a> + a2 +1 
For example, a” = (10100) means a’ =a 


P | I 


0 1 2 3 4 5 6 - 7 8 9 10 nN 12 13 45 {'® 17 1B eee 28 29 30 


9,0 0,1 0,2 9 . 99.43 %,14 %,15 [1 0 9 eee 
gd a ; “es _ 21,13 %,14%1,15]9 1 0 see 
92,0 92,1 92,2 7S @ 139,14 %2,15|0 9 1 eee 
e e e | e ° 
° eee e | ° eee 
e e “e | eo. e 
12,0 “12,1 %12,2 eee M2 3 12,142,151 000 eee 
43,0 13,1 913,2 <o8 3 3 %13,14913,1519 0 0. 8 ee 
14,0 14,1 14,2 es oy B ow.eH. re 000 eee 


Figure 2. - Generator matrix G of (15,13) Reed-Solomon codes; 


as j is a field element from GF(2°). 


oo 2 


V(X) = (momm... -M_» m,_,)¢ (10) 


where the message symbol m,_,} is the most significant digit. From figure 2 
and equation (10), the parallel matrix encoding method requires about 

15 x16 = 240 Galois field multipliers. Because of the logic complexity, this 
method is generally not used except for very high-speed applications, or when 
G is very simple. For example, consider a 32-bit memory system built by 
using n words by 4-bit integrated-circuit, random-access memory (IC RAM) 
chips. In such a memory system, a single chip failure will result in a 4-bit 
error in the 32-bit word. A t = 1 (15,13) RS code with code symbols from 
GF(2") can be used for error correction in such a memory system. The G 
matrix of this code requires about 13 x2 = 26 Galois field multipliers. Using 
current technology, these multipliers can be implemented by table lookup using 
read-only memory (ROM) chips. 


There are two shift register methods for encoding linear cyclic codes. 
One method uses a (n-k)-stage shift register, and the other uses a_ k-stage 
shift register (ref. 7). In practice, the (n-k)-stage shift register is most 
commonly used unless n-k is much greater than k. For encoding the (31,15) 
RS code, a (n-k) = 16 stages shift register (fig. 3) can be used to implement 
equation (4). The feedback multipliers ag, 4,,...,%15 are field elements 
of GF(2™). Each R; register stage is a 5-tuple shift register. The 
operation of the encoder is as follows. With Sl set for feedback and S2 set 
to position 2, k information symbols are shifted into the encoder and simul- 
taneously sent to the channel. Then S1 is turned to disable the feedback and 
$2 is turned to. position 1; the 16 parity-check symbols stored in the encoder 
now are shifted out to the channel, clearing the shift registers. 


Sl 


V(X) 
TO CHANNEL 


Figure 3. - Encoder for (31,15) code; aj is a field element from 
GF(2°) and R; isa 5-tuple shift register stage. 


DECODING 


In this section, an error correction procedure is described for the (n,k) 
t-error-correcting RS codes, and a design implementation is shown for the 
(31,15) t = 8 RS code. As described earlier in encoding, RS codes are non- 
binary codes. Therefore, the decoding procedure involves finding not only 
the error locations, but the error values as well.’ A t-error-correcting RS 
code generated by g(X) is a cyclic code, and the codewords consist of all 
multiples of g(X). The degree of g(X) is 2t and its roots are a,a’,..., 
a2, Since every codeword is some multiple of g(X), a, a%,..., 02% are 
also roots of every codeword. 


Let V(X) be the transmitted codeword, E(X) be the channel noise error 
pattern, and R(X) be the received codeword represented as follows: 


_ V(X) = vo typ x + vo x* +...4+ ve une 
E(X) = eg te, x + ep x2 ee el gee 
R(X) = rg,t vr, x + rox* +...+ r-l suet 


where vj, e;, and r;, i = 0, 1,2, ...,n-l, are elements of GF(2™). At the 
decoder, 


R(X) = V(X) + E(X) (11) 


The error pattern’ E(X) can be described by a list of values and locations of 
its nonzero components. For the decoding procedure to be described, the 
error location will be given in terms of an error-location number, which is 
simply aJ for the (n-j)th symbol. Let xX, be the error location number 
and e; be the error value. Then for each nonzero component of E(X), a 
pair of field elements (xj, ej) is required to describe that error. If E(X) 
has p errors, then p pairs (x:, e;) are required to describe the errors. 
Any decoding procedure is a procedure for locating these p pairs of (x3, e;) 
if pst. 


Assume that E(X) is an error pattern of p errors at locations jj, 

jas -- ->Idp- Then, ~ 
E(X) = ei, xl 4 ie +...u.t eis x/P (12) 

where p S$ t and 0 < j, < jg <...< ip <n-l. The first step in decoding is 
to check whether V(X) is a codeword by calculating the syndrome. If the 
syndrome is zero, then either V(X) actually has no errors or V(X) has an 
undetectable error. In either case, V(X) is accepted as no error. A nonzero 
syndrome indicates that an error has been detected; the error may or may not 
be correctable. For the RS codes, the syndrome is defined as a vector §S 
with 2t components as follows (refs. 7, 8): 


S.= R(a~) = fp FF] a + r> (al)? to. et fan (at)e (13) 
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for i= 1, 2,3, ...,2t. Combining equations (11) and (13) gives the result 
S, = V(a") + Ela’) (14) 


Since V(a") = 0, 
i . i 
2=1 
Combining equations (12) and (15) gives the result 
So Segk Hed Ty ee tad eye ial) 
Si e;, (a )" + e jo (a yo t.. .t Se P) (16) 
for i= 1,2,..., 2t. Expanding equation (16) gives the result 


“te NT etn 58 ade 
8] e5, 4 + e5,% eee eae 


J1y2 ey ae se ae - (adpy2 
, (o - + ej, (a a or ee 25,80 P) 


° 
e 


; = fd byt - (gd2)2t (qi py2t 
ot e;, (a ) + ej, (a ) tee at e350 ) 


1p) 
N 
HT] 
iv) 
uu 


(17) 


S 


Equation (17) is a set of 2t nonlinear equations which relates the 2t 
known quantities of Sj to 2p unknowns. consisting of p unknown locations 
and p unknown error values. Any error correction procedure is a method of 


solving this set of equations for the p _ pairs of (E59> a2), 2 1, 2,. 64,5 


p. Once aJ!, oJ2,..., 0/P are found, the powers 41, j2,+ ++» ip will indi- 
cate the error locations in E(X). In general, there might be many error pat- 
terns that satisfy the 2t equations. If p < t, then the error pattern 
with the smallest p is the actual error pattern. 


For notational convenience, let 


’ 


B =a Ik, g=1,2,...,~ (18) 


be the error location numbers (refs. 7, 8). Now equation (17) can be written 
as 


Sy eye ees Bo +...+ es 


32 Jp “P 
_ 2 5 2 
So ej, By ey, Bo +t... t+ ° 4p Bp (5) 
2t 2t a 8 
a Pas A aa i a as 


These 2t syndrome components are symmetric functions in 8), 89,..--, 6B, 
which are known as power-sum symmetric functions. Next, let the error loca- 
tion polynomial be defined as follows: 
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o(X) = (1+8, X)(1+85X).. - (1 +B, X) 
= 09 to, X+ 2X? +...+ 0, xP (20) 
where 
69 = l 
6) = B) + Bo +... + B, | 
o2 = By; B2 + 8B} 83 +-- -+ BL) B, 
03 = By 8283 + Bi B23 4+...+ Bo-2 Bo 8, (21) 


0, = By B283.- » By 


The roots of o(X) are 81, 8o 7, .- ighg Bad which are the inverses of the 
error location numbers. It can be seen from equations (19) and (21) that 
‘the coefficients of o(X) are related to the syndrome components S,;, i = l, 
2, ...,2t. The coefficients 06], 062,...,6, are known as elementary sym- 
metric functions of 81, Bo,.--+» Bp. Therefore, if it is possible to find 
o(X) from the syndrome components, the error location numbers can be found 
and the error pattern E(X) can be determined. In the following, an effec- 
tive decoding procedure is described and a design implementation for the 
(31,15) RS code is given. This procedure consists of four major steps as 
follows (refs. 7, 8). 


- Calculate the syndrome S = (Sj, So,...; Sot) from R(X) 
- Calculate the error location polynomial o(X) from S 


J 


. Calculate the error value e, from X, and §S 


1 

2 

3. Determine the error locations X; by finding the roots of o(X) 
i J 


STEP 1. SYNDROME CALCULATION 


There are two methods for calculating the syndrome (S): the standard 
method, and the shift register method. -The standard method uses equation 
(13) because this is the way the syndrome is defined. The shift register 
method uses the g(X) encoding shift register. 


In the standard method,, the syndrome calculation given in equation (13) 


ae n-1 
8} = rg + rj a! + rp a2 a ee r-1% 
2(n-1 
So = rg + rj a* + ratte. .t r-1% (a1) 
(22) 
3 6 3(n-1) 
$3 = Yop tr, a> t+ roa, t+... t+ Ta-1% 
t ~- 
Sot = rg t+ ry a” + rate caer ae wi ri-l aoe v 
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-In matrix form, equation (22) can be written as 
(at)J 
S = RH = (roti r2---T 4 4) 1-31, 2333 ee Zt (23) 
, 0,1,2, ...,n-1: 


J nx2t 

where H! is an n rows by 2t columns matrix. For the (31,15) t = 8 RS 
code, H! is a 31x16 matrix shown in figure 4. An all-parallel syndrome 
calculation would require about 480 multipliers and 16 3l-input modulo-2 add- 
ers in GF(2°), which is indeed very expensive. One compromise is a serial- 
parallel method using 16 circuits of the type shown in figure 5. These cir- 
cuits required 31 clock cycles to calculate S$, or about 1.55 wsec at the 
present STTL technology. _ 


1 2 3 4 5 eee 15 16 

0 1 1 T 1 1 eee 1 1 
1 a az a? a® a aecs at git 

2 a2 a® at a? ai? Pee a?? = 
3 a a’ a? a? a's eee ai* al? 
4 a*® a® e ° ° . e 
5 a até . e . e o 
6 aé a? ° . ° e e 
7? a’ al® ° ° ° Ps ° . 
8 a? ais - ‘ . tg ° ° 
ne 2 9 a a? ° . . e ° 
10 at? a?? < . ° . . 
WV ai! a?? ° . . ° 
12 gi? az* a? al? az? see a2? at 
e e e e e e e e e 
e e e e e e e e e 
e e s e s e se e es 
29 a7? a?’ a?5 a? a?? eee a a’? 
30 a? a2? a2? a2? azé cee ais als 


Figure 4. - Parity-check matrix for the (31,15) t = 8 
Reed-Solomon code. 
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ry (at) 
GF(25) 
MULTIPLICATION 
BY TABLE LOOKUP 


(at)J 
GENERATOR 
(TABLE LOOKUP) 


Figure 5. - Circuit for calculating the syndrome component Si- 
Approximately six STLL MSI ICs are required. 


For large t(t24), the best way to calculate the syndrome is to use the 
‘g(X) shift register. This will result in some saving in logic because this 
shift register is already used for encoding. ‘However, the S calculated by 
g(X) is not the same as the S$ calculated by R(at) of equation (13), but 
they are related. This relationship is described below. 


From equation (13), let 
S = (Si, S2,- + +s Sop) (24) 


be the syndrome calculated by R(a*) with 


S. = R(a’) (25) 
for i= 1,2,...,2t. Let 
— * 
SPS *) SA, 6.o <4 S55) (26) 


be the remainder calculated by dividing R(X) by g(X). The remainder S* 
is another form of the syndrome but S* # S. Using the Euclidean division 
algorithm, the result of R(X)/g(X) is 


R(X) = Q(X) g(X) + S*(X) (27) 
where Q(X) is the quotient and 
14 


2 2 * 2t-l 
S*(X) = S*¥ + SX + S¥X +...+ 8,5 % (28) 


is the remainder. Substituting X by a in equation (27) gives the result 


R(a‘) Q(at) ta) + $*(at) 


S* (a4) (29) 


It 


since Q(at) g(at) = 0. From equations (25), (28), and (29), the relationship 
between S and S* is 


i 
= Sx 
S; S*(a’) 
? F eed. 
= SF + S#(a*) + Sk(a7)2 +... + SK (a)? (30) 
for i= 1,2,..., 2t. The relationship of equation (30) can be implemented 


by the circuit shown in figure 6. This circuit is very similar to the Chien 
search circuit as described later in step 3. In figure 6, the 2t components 
of S can be obtained from S* in 2t clock cycles. 


SUM (= ) CIRCUIT s 


Figure 6. - Modified Chien search circuit for obtaining S. from s% 


In the above discussions, two basic methods for syndrome calculations 
were presented. For large t, it is concluded that the best way to calculate 
the syndrome is to divide R(X) by g(X) to obtain S*(X), and then obtain 
S; from S;* by a modified Chien search circuit. If R(X) contains no error, 
S*(X) = 0, and this calculation takes n clock cycles, which is the same 
speed as the serial-parallel method of figure 5. If R(X) contains an error, 
S; is obtained from S,*; then go to step 2, and so on. One significant 
point, which should be mentioned, is that this method of syndrome calculation 
requires practically no additional logics because both the g(X) shift regis- 
ter and the Chien search circuit already exist. 


Cy 
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STEP 2. o(X) CALCULATION 


A method for calculating the error location polynomial o(X) is given 
below without proof. This method is the Lin's table (ref. 7), which is an 
interpretation of Berlekamp's iterative algorithm (ref. 9). The same method 
can also be found in reference 8 in a slightly different form. 


To find o(X), start with the table 


~_() 

».¢ d R - 2 
oe ci | Sik SG 
i 1 1 0 = 
0 1 S) 0 0 

1 
2 


and proceed to fill out the table. Assuming that the table has all rows 
filled out up to and including the uth row, then fill out the (ytl)th row 
as follows: 
= (u+1) — ch) = 
1. If qd = 0, then a (X) =o and dtl = m 
2. If d, #0, find another row preceding the uth row, say the pth 
row, such that the number 0 - 2. in the last column of the table 
has the largest value and d, #0. Then 


and 
Bd = max (2, Rot 0] (32) 
In either case, 
= (ut+1) (u+1) 
qi = awe + Oy Stl bce Gk OR] Sut2-2 47 (33) 
where the a are the coefficients of go HED egy | 2 


oD xy 1 tO x + oY x2 4. 4 gD hut (34) 


If V(X) has exactly t errors, then the polynomial o (2) (x) in the last 
row is the required o(X). If V(X) has more than t errors, then o 2t) (x) 
has degree greater than t, and generally it is not possible to locate the 
errors. If V(X) has fewer than t errors, the table terminates into a mode 
prior to step 2t where d, = dy) = 0 and of) (x) = o 1) (x), 
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The computation for o {W) (x) and d combined, on the average, requires 
about 2t additions and 2t ite Apl ications for each step. Since there are 
2t steps, the total is about 4t? additions and 4t2 multiplications. For 
the (31,15) t = 8 RS code, at most 16 iterative steps are required to obtain 
o(X), and each step requires about 16 additions and 16 multiplications in 
GF(2°), plus inversions. At the present technology, an economic method for 
performing these computations is to have a Galois field arithmetic unit imple- 
mented by microprogramming a microprocessor like the 2900 series microproces- 
sor family. With a few hardware augmentations for inversions and special 
instruction controls, these Galois field arithmetic computations can be exe- 
cuted at 100 nsec per instruction. At this rate and allowing a 300% program 
overhead, the o(X) calculation would take about 100 usec. 


STEP 3. DETERMINATION OF ERROR LOCATIONS 


The error location numbers are the reciprocals of the roots of o(X). 
The roots of o(X) can be found simply by substituting 1l,a,a2,.. .,an-l 
into o(X). Since of = 1, then a7* = gM-%. Therefore, if oa*% is a root of 
o(X), a®-% is an error location number and the received digit Th-g is in 
error. If n is large, this substitution method is not desirable because 


the length of computation can.be long. 


If oy, i= 1,2,...,p, and p< t are known from step 2, and using 
the fact that RS codes are cyclic, a procedure credited to Chien known as the 
Chien search (refs. 7-9) can be used to find the error location numbers. The 
received codeword R(X) is tested on a digit-by-digit basis starting with the 
high-order digit r,_; first. To decode r,_;, the decoder tests whether 
on is an error location number. This is equivalent to testing whether a 
is a root of o(X). If a is a root, then from equations (20) and (21), the 
result is 


0, a + 0 0% to. et Le oP = 1 (35) 
If equation (35) is satisfied, then gh-1 is an error location number and 
the digit r,-; has an error; otherwise, r,_; has no error. To decode 
Ty-2> the decoder tests 


ony ot + op a2 +o. .t ue ae = 1 (36) 


If equation (36) is satisfied, then a is a root of o(X) and the digit 
Tp-g has an error; otherwise, rpg has no error. 


The error location numbers testing procedure of the Chien search can be 
implemented in a straightforward manner by a circuit such as that shown in 
figure 7. The t o-registers R,, R2,..., Rt are initially stored with 
O1» 525+ +-+, Of calculated in step 2. It should be noted that if p < t, the 
register stages §pel Ro+2> - - +, Ry are stored with zero since 
Opt] = Spt2 = eee = Ft 0. To test r,_,, the circuit is pulsed once. The 
multiplications are performed and 0) 4,074a°,...-,06,aP are stored in the 
o-registers. The output of the SUM circuit is 1 if and only if the sum 
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SUM ( £ ) CIRCUIT 


X;-REGISTER 
(n bits) 


Figure 7. — Chien search cyclic error location circuit. 


a, a+ 52 a2 SB soe SF o aP = 1 
Otherwise, the output of SUM is 0 or al, Having tested frp-}j, the circuit is 
pulsed again. Now 06} a*,o9a",. ..,0,02P are stored in the o-registers. 
The output of the SUM circuit is l if and only if the sum 


o1 02 +onoat+...4+ a a7P = 1 


. 


Otherwise, the output of SUM is 0 or a+. This process continues until all 

n digits of R(X) are tested. The SUM output is shifted to an n-bit Xi- 

register. For example, for n= 15 and if the pattern in the X;-register is 
0123456789 10 11 12 13 14 
0001001000001 0 0 

then the digits numbered 12, 6, and 3 of R(X) are in errors. 

For the (31,15) t = 8 RS code, eight 5-tuple register stages are 
required. The multiplication circuits of a, a“,...,a° can be implemented 
by table lookup using ROMs. The function of the SUM circuit can best be per- 


formed by the microprocessor since this function requires only eight XOR 
operations and a testing for "1" operation. 


STEP 4. CALCULATION OF ERROR VALUES 


2=1,2,..., p, can be calculated from equation 


The error values Cj? 
(19) once the syndrome components S1,S2,-..,5,, and the error locations 
numbers 8), Bo, - - +» Bp are known. The set of equations in equation (19) is 
linear and one method of solving it for e; is credited to Forner (refs. 


JL 
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7-9). It should be noted that the error location number By is equal to 


atk, and aJ2 = gi = a7%, where a* is a root of o(X) as described in 
step 2. Once the p pairs (X55 4.) for 2 = 1,2,..., p are known, E(X) 


in equation (12) can be determined. 
To find Cjg: let 


Z(X) = 1+ (S] +0))X + (Sp +0, S$} +o2)K* +... 
+ (S, +o, S,yto2S,o+--- +0,,)XP (37) 


then 


Z(B5 1) 
= + (38) 
4 -1 
T(1+B, By ) 
i=1 
ity 


pais 


The computation of in equation (38) is relatively complex. For a 


[jp 
t-error-correcting RS code, and assuming E(X) has p< t digits in errors, 
the computation of Z(X) requires p+p(ptl)/2 additions and p(ptl)/2 
multiplications for each digit in error. Let D be the denominator of equa- 
tion (38). The interpretation of D is that D should consist of P - l 
_ terms. The missing term in D is the term where 8; = Bg. The computation 
of D requires p - 1 additions and (p-1) + (p-2) multiplications for each 
digit in error. 


For the (31,15) t = 8 RS code, the computation of in equation (38) 


ig 
can be performed by the Galois field arithmetic unit implemented by micropro- 
grams in the 2900 series microprocessor. Assuming a 100-nsec instruction 
execution rate and a 100% program overhead, the computation for each error 
value will require about 20 usec, or about 200 usec for all eight error values. 


CONCLUSION 


A decoding procedure has been described for the (n,k) t-error-correcting 
Reed-Solomon (RS) codes. The code parameters are chosen such that the codes 
are cyclic so that encoding and some parts of decoding can be implemented by 
shift registers. In particular, a logic implementation scheme has been pre- 
sented for the (31,15) t = 8 RS code. The principal features are (1) a 
Galois field arithmetic unit implemented by microprogramming a 2900 series 
microprocessor, and (2) syndrome calculation by using the g(X) encoding shift 
register. This arithmetic unit is capable of executing instructions at a 
100-nsec clock cycle rate. Encoding the (31,15) code would be implemented by 
shift registers requiring 31 clock cycles (3.1 usec). Decoding the (31,15) 
code requires four steps of computation with a total decoding time of about 
500 usec. With present technology, it is felt that the microprogrammable 
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microprocessor is the most feasible approach to decode this code except for 
the case where n and t are small, for example, n < 15 and t s 2. For these 
cases, an all-parallel method is probably more suitable. 


Ames Research Center 
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